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Dear Editor, 

Drosophila tudor is a maternal effect gene required 
for germ cell formation and abdominal segmentation 
during oogenesis [1,2]. It encodes a large protein, Tudor 
(Tud), of 2 515 amino acids, containing 1 1 copies of a 
~60-residue sequence motif, termed the tudor domain. 
Tudor domains are best characterized by their methyl- 
lysine and methylarginine binding abilities [3-5]. Bio- 
chemically, Tud interacts with Aubergine (Aub), a Piwi 
family protein, in a manner dependent on symmetrically 
dimethylated arginine (sDMA) residues located at the 
N-terminal end of Aub [6-8]. The sDMA-dependent in- 
teraction between Tud and Aub is part of a broad range 
of phenomena involving tudor domain and Piwi family 
proteins, in species ranging from fruit flies to mammals 
[3]. We and others have shown previously that domains 
7-11 of Tud (Tud7-ll) were necessary and sufficient for 
germ cell formation and interaction with arginine -meth- 
ylated Aub [9-11]. The structure of Tud 11 in complex 
with sDMA-Aub peptides, together with the structure 
of human SND1 (TDRD11) bound to sDMA peptides 
from PIWIL1, uncovered the composition of the sDMA- 
binding pocket, which consists of a cage of four aromatic 
residues and an asparagine [10, 12]. Interestingly, the 
sDMA-binding tudor domains are rigidly embedded in 
a structural module termed the extended Tudor domain 
(eTud), which comprises an additional juxtaposed OB- 
fold domain [13, 14]. It is likely that all tudor domains in 
Tud7- 1 1 have an eTud fold (Supplementary information, 
Figure SI A) [10]. However, apart from Tudll, the struc- 
tures and sDMA binding properties of tudor domains are 
unknown. More importantly, a mechanistic understand- 
ing of Tud's molecular functions in germ plasm requires 
the knowledge of the spatial organization of the tandem 
tudor domains as a whole. 

To learn the overall structure of Tud7- 1 1 and the fea- 
tures of individual structural units, we expressed Tud7- 
1 1 (aa 1 6 1 7-2 515) in E. coli and probed its domain or- 
ganization by limited proteolysis. Digestion of Tud7-ll 
with V8 (Glu-C) protease produced two smaller, stable 
bands corresponding to Tud9-ll and Tud7-8, as deter- 
mined by N-terminal sequencing (Figure 1A). However, 



crystallization attempts of recombinant Tud7-ll and 
Tud9- 1 1 were unsuccessful despite many efforts. We did 
crystallize a recombinant Tud 10- 11 fragment (aa 2 163- 
2 515) and solved a 3.0 A structure (see Supplementary 
information, Table SI). The structure shows that, like 
Tudll, Tud 10 is also embedded in a Tudor-SN/p300- 
like OB-fold scaffold, forming an eTud unit (Figure IB). 
The two eTud modules have a similar overall fold, with 
a root-mean-squared (r.m.s.) deviation of 1.81 A. Com- 
parison of the present Tudll structure with the previ- 
ously determined peptide-bound Tudl 1 structure (Tudl 1- 
pep) gives rise to an r.m.s. deviation of 1.29 A (Supple- 
mentary information, Figure SIB). The aromatic cage 
involved in binding sDMA in Tudll is not conserved in 
TudlO: Ser2220 and Leu2245 in TudlO take the place 
of Phe2403 and Phe2427 in Tudll, respectively (Figure 
1C and Supplementary information, Figure SI A). Super- 
position of the TudlO and Tud 11 -pep structures reveals 
two deleterious effects of Ser2220 for sDMA binding: 
1) it lacks cation-;: interactions with sDMA; 2) its small 
size allows Tyr2227 to move closer, making the pseudo 
sDMA-binding pocket too small to accommodate sDMA 
(Figure 1C). It appears that Leu2245 is less harmful for 
sDMA binding, as it can maintain hydrophobic interac- 
tion with the methyl group. Consistent with the structural 
observation, we detected no interaction between recom- 
binant TudlO and methylated Aub peptides by ITC. 

The Tudl0-ll structure revealed for the first time 
the packing mode of methylarginine-binding tandem 
eTud domains. In the structure, the two eTud modules 
are rigidly juxtaposed in a head-to-head manner (Figure 
IB). The packing buries a total surface area of 1 007 A 2 , 
which indicates a moderate binding affinity between the 
two eTud modules. A majority of the sidechain contacts 
are between residues located in the OB-fold domains, 
and they are mostly polar interactions. In particular, 
Glu2305 and Tyr2326 each forms a hydrogen bond with 
Arg2454, and Arg2323 forms two hydrogen bonds with 
Asp2369 (Supplementary information, Figure SIC). An 
E2305R/R2323E/Y2326A triple mutation results in an 
altered spatial organization between TudlO and Tudl 1, as 
evidenced by the change of sedimentation coefficients in 
analytic ultracentrifugation experiments (Supplementary 
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information, Figure SID). These features are consistent 
with the properties of two independently folded, mod- 
erately associated eTud modules that can be separately 
expressed and purified, even crystallized in the case of 
Tudll [10]. 

We also crystallized and solved a 2.5 A structure of 
Tud9 (aa 1 978-2 160), which shows that it also forms 
an eTud unit (Figure ID and Supplementary informa- 
tion, Table SI). Superposition of Tud9 with Tudll of the 
Tudl 1-pep complex yielded an r.m.s. deviation of 1.9 A. 
Major differences between Tud9 and Tudll include: 1) 
the conformation of the loop connecting aA and P3 in 
the tudor domain; 2) the presence of an additional short 
helix, aC, in the OB-fold domain of Tud9; and 3) an 
extended aD in Tud9 (Supplementary information, Fig- 
ure S1E). Despite these differences, the putative sDMA- 
binding pocket in Tud9 is relatively well conserved 
(Figure IE). One main difference is between Leu2058 of 
Tud9 and Phe2427 in Tudll. ITC measurements of the 
binding of the wild-type and the L2058F mutant of Tud9 
to sDMA peptides of Aub revealed dissociation con- 
stants (Kd) in the sub-millimolar range for both proteins 
(Supplementary information, Figure S1F). The weak but 
robust interactions indicate that: 1) the putative sDMA- 
binding pocket is an authentic one; 2) the insensitivity 
to the flanking sequences of sDMA peptide suggests 
that either Tud9 is a promiscuous sDMA binder or that 
Aub is not its true binding partner; and 3) an aromatic 
residue at the position occupied by Leu2058 is not nec- 
essary for sDMA binding — a hydrophobic residue is 
most likely sufficient. Both Tud7 and Tud8 have intact 
sets of sDMA-binding residues, and ITC analyses indi- 
cate that Tud7-8 can simultaneously bind two ariginine- 
methylated Aub peptides with a Kd value of 10 u,M 
(Supplementary information, Figure S1G). In compari- 
son, an isolated Tud8 binds Aub peptides with Kd values 



ranging from 14 to 32 uM, depending on the location of 
the sDMA residue within the Aub peptide (Supplementary 
information, Figure S 1 G). 

To obtain an overall understanding of the domain 
organization in Tud7-ll, we performed solution small 
angle X-ray scattering (SAXS) experiments to determine 
the protein envelope. Ab initio modeling using the SAXS 
data yielded an L-shaped envelope of Tud7-ll (Figure 
IF and 1G). Three lobes of the envelope are clearly vis- 
ible, one elongated lobe connected via a thin neck to a 
bilobal, calabash-shaped body. Based on the proteoly- 
sis result and the structural information, the globular, 
calabash-shaped body can be unambiguously assigned to 
Tud9- 1 1 , with Tud9 being the smaller lobe, connected to 
the larger lobe of Tudl 0-11 (Figure IB, ID and 1G). The 
size of the elongated lobe is capable of accommodating 
two eTud modules. Crystallization of Tud7-8 or Tud7 
and Tud8 separately was not successful despite many ef- 
forts. We then generated the structures of Tud7 and Tud8 
by homology modeling using the Tud9 structure as the 
template. The crystal structures of Tud9, Tud 10-11, and 
the modeled Tud7 and Tud8 structures were then placed 
into the SAXS-generated envelope using a combined 
approach of computational docking and manual adjust- 
ment (Figure 1 G and Supplementary information, Figure 
S1H). The low-resolution, approximate model of Tud7- 
1 1 conforms to the molecular envelopes well, and it can 
account for known biochemical properties of Tud7-ll. 
The Tud7- 1 1 model structure shows that all sDMA-bind- 
ing sites are well spaced and accessible, with the sDMA- 
binding sites of Tud7, Tud9 and Tudll located on one 
side of the protein surface (Supplementary information, 
Figure SI H). 

The domain organization of Tud7-ll provided first 
evidence that certain eTud modules may pack into a 
compact structure. This finding is biologically signifi- 



Figure 1 Structure of Tud7-11. (A) Domain organization of Tud7-11. Top, a schematic diagram showing distribution of eTud 
modules based on sequence alignment and secondary structure analyses. The domains and their spacing were approximate- 
ly drawn in scale. Bottom: time course of limited proteolysis with V8 protease generating two stable fragments corresponding 
to Tud7-8 and Tud9-11. (B) Overall structure of Tud10-11 shown as a ribbon model superimposed with a semi-transparent 
surface representation. Regions spanning the canonical tudor and OB-fold domains are enclosed in red and brown dashed 
ovals, respectively. Residues at the putative sDMA-binding pockets are indicated in green. (C) Superposition of the putative 
sDMA-binding site of Tud10 with that of Tud11 in the Tud11-pep complex. The involved residues are shown in a stick model (light 
blue for Tud11 and green for Tud10). The Aub peptide bound to Tud11 is shown as a reference, with the sDMA residue shown 
in a stick model (brown). (D) Overall structure of Tud9. (E) Comparison of the sDMA-binding pockets of Tud9 and Tud11. The 
Aub peptide bound to Tud 11 is shown in brown, with the sDMA residue shown in a stick model. (F) Experimental SAXS curve 
from Tud7-11 in solution. Green dots represent the experimental SAXS data points. Blue dashed line represents the scatter- 
ing curve computed from the ab initio envelope. The inset at left bottom corner shows the distance distribution functions p(r). (G) 
Molecular envelope of Tud7-1 1 . The ab initio low-resolution SAXS-derived envelope is shown in a sphere representation. The 
crystal structures of Tud9, Tud10-11, and the simulated models of Tud7 and Tud8, all shown in a ribbon representation, were 
docked into the envelope as rigid bodies. 
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cant, as the packing creates asymmetry among individual 
Tud units that may affect their ligand binding proper- 
ties, as exampled in methyllysine-binding tandem tudor 
domains. A different domain organization was observed 
for the four-tudor domain fragment of murine TDRD1, 
which showed a flexible, extended conformation from 
a SAXS study [15]. The extended vs the more compact 
conformation of the tandem tudor domains would place 
the binding partners of individual tudor units at different 
geometric arrangements, constraining the interactions of 
binding partners with the tudor domain proteins, as well 
as those between themselves and with downstream tar- 
gets, hence determining the biological properties of the 
tudor domain proteins. In Tud7-1 1, the bulky Tud9-1 1 is 
connected to the elongated Tud7-8 via a thin hinge, sug- 
gesting that the two structural ensembles have a flexible 
relative positioning, in agreement with the domain map- 
ping result by proteolysis. The model also indicates that 
all sDMA-binding sites are accessible, suggesting that 
Tud7-ll can bind multiple arginine-methylated proteins 
simultaneously. Our previous study showed that muta- 
tions compromising aromatic cages in each of the five 
tudor domains, except Tud8, resulted in defective germ 
cell formation [10]. While it can be argued that ligand 
binding by Tud8 is not essential for germ cell formation, 
it is surprising that mutation in Tud 10 has a detrimental 
effect, as its tudor domain does not have a complete set 
of aromatic residues needed for binding sDMA. In fact, 
localization of the TudlO mutant to the germ plasm was 
unaffected, pointing to the requirement of a separate set 
of intermolecular interactions involving unmethylated 
binding partners for its function in germ cell formation. 
Coincidently, a highly conserved residue, Arg2228, re- 
quired for stabilizing the putative ligand-binding site in 
TudlO, is also needed for Tud function in germ cell for- 
mation [9]. 

In summary, the overall spatial organization of the 
tandem eTud modules of Tud7-ll, together with the 
knowledge of structures and sDMA binding properties 
of individual eTud units, provided new mechanistic in- 
sights into ligand binding properties of Tud in germ cell 
formation, and the information should be useful to guide 
further efforts in identifying new interaction partners and 
aid the understanding of the assembly of polar granules. 
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